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Abstract — Families of asymptotically regular LDPC block 
code ensembles can be formed by terminating (J, A') -regular 
protograph-based LDPC convolutional codes. By varying the 
termination length, we obtain a large selection of LDPC block 
code ensembles with varying code rates, minimum distance that 
grows linearly with block length, and capacity approaching 
iterative decoding thresholds, despite the fact that the terminated 
ensembles are almost regular. In this paper, we investigate the 
properties of the quasi-cyclic (QC) members of such an ensemble. 
We show that an upper bound on the minimum Hamming 
distance of members of the QC sub-ensemble can be improved 
by careful choice of the component protographs used in the code 
construction. Further, we show that the upper bound on the 
minimum distance can be improved by using arrays of circulants 
in a graph cover of the protograph. 

I. Introduction 

II Low-density parity-check (LDPC) codes [1] based on a 
protograph [2] form a subclass of multi-edge type codes that 
have been shown to have many desirable features, such as 
good iterative decoding thresholds and, for suitably-designed 
protographs, linear minimum distance growth (see, e.g., [3]). 
Analogously, ensembles of LDPC convolutional codes [4], the 
convolutional counterparts to LDPC block codes, can also be 
constructed using protographs and display the same desirable 
properties (see [5] and [6], respectively). 

So-called asymptotically regular LDPC block code ensem- 
bles [7] are formed by terminating (J, iC) -regular protograph- 
based LDPC convolutional codes. This construction method 
results in LDPC block code ensembles with substantially better 
thresholds than those of (J, K)-regulai LDPC block code en- 
sembles, despite the fact that the ensembles are almost regular 
(see, e.g., [7]). These codes were analysed further in [8] and 
were also shown to have minimum distance growing linearly 
with block length, i.e., they are asymptotically good. As the 
termination length tends to infinity, it is further observed 
that the iterative decoding thresholds of these asymptotically 
good ensembles approach the optimal maximum a posteriori 
probability (MAP) decoding thresholds of the corresponding 
LDPC block code ensembles. More recently, this property has 
been proven analytically in [9] for the binary erasure channel 
(BEC) considering some slightly modified ensembles. 

Members of the protograph-based LDPC code ensemble that 
are quasi-cyclic (QC) are of great interest to code design- 
ers, since they can be encoded with low complexity using 
simple feedback shift-registers [10]. Moreover, QC codes can 

This work was partially supported by NSF Grants CCF08-30650, DMS- 
0708033, and TF-0830608 and NASA Grant NNX09AI66G. 



be shown to perform well compared to random codes for 
moderate block lengths [11], [12]. However, unlike typical 
members of an asymptotically good protograph-based LDPC 
code ensemble, codes from the QC sub-ensemble cannot be 
asymptotically good. Indeed, if the protograph base matrix 
consists of only ones and zeros, then the minimum Hamming 
distance is immediately bounded above by {uc + 1)!, where 
Tic is the number of check nodes in the protograph [13], [14]. 
In this paper, building on recent results by Smarandache and 
Vontobel [15], we show that the upper bound on the minimum 
Hamming distance of members of the QC sub-ensemble of 
asymptotically regular (J, K) LDPC codes can be improved 
by careful choice of the component protographs used in the 
code construction. Even though we show that the QC codes 
from the ensemble are not 'typical', we see that constructions 
that improve the ensemble minimum distance growth rate also 
increase the upper bound on minimum distance for members of 
the QC sub-ensemble. In addition, for several of the examples 
given in the paper, QC codes are constructed that achieve 
this upper bound. Further, we show that the upper bound 
on minimum distance can be improved by using arrays of 
circulants in a graph cover of the protograph. 

II. Analysis of protograph-based LDPC codes 
A protograph is a small bipartite graph B = {V, C, E) that 
connects a set of n„ variable nodes V — {uq, . • . ,i'n„-i} to 
a set of ric check nodes C = {cq, . . . , c„^_i} using a set of 
edges E. The protograph can be represented by a parity-check 
or base biadjacency matrix B, where B^^y is taken to be the 
number of edges connecting variable node Vy to check node 
Cx- Figure [T] shows an example of an irregular protograph with 
repeated edges and the associated base matrix. 
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Fig. 1: An example of a protograph and the associated base 
matrix. 

This protograph is called irregular because both the variable 
and check node degrees are not constant. 

An ensemble of protograph-based LDPC block codes can 
be created from a base matrix B using a copy-and-permute 



operation [2]. A parity-check matrix H from the ensemble of 
protograph-based LDPC block codes can then be obtained by 
replacing ones with an A^ x A^ permutation matrix and zeros 
with the Af X A^ all zero matrix in the base matrix B. In the 
case when a variable node and a check node are connected 
by r repeated edges, the associated entry in B equals r and 
the corresponding block in H consists of a summation of r 
N X N permutation matrices. The ensemble is defined as the 
set of all possible parity-check matrices H that can be formed 
using this method. 

By construction, every code in the resulting ensemble has 
the same node degrees and structure. The ensemble design rate 
is given as i? = 1 — nc/ny In addition, the sparsity condition 
of an LDPC matrix is satisfied for large A^. The code created 
by applying the copy-and-permute operation to an n^ x n„ 
protograph base matrix B has block length n = Nriy. 

A. Density evolution for protograph-based ensembles 

Since every member of the protograph-based ensemble pre- 
serves the structure of the base protograph, density evolution 
analysis for the resulting codes can be performed within the 
protograph. In this paper, we assume that belief propagation 
(BP) decoding is performed after transmission over a BEC 
with erasure probability e. Let p*^*' denote the probability that 
the incoming message in the previous update along an edge 
of an arbitrary check node is an erasure. Then the density 
evolution threshold of an ensemble is defined as the maximal 
value of the channel parameter e for which p^*' converges to 
zero for all edges as the number of iterations i tends to infinity. 

B. Weight enumeration for protograph-based ensembles 

The preserved structure of members of a protograph-based 
LDPC code ensemble also facilitates the calculation of average 
weight enumerators. An ensemble average weight enumerator 
Ad tells us that, given a particular Hamming weight d, a typical 
member of the ensemble has Ad codewords with Hamming 
weight d. Combinatorial techniques for calculating enumera- 
tors for protograph-based ensembles have been presented in 
[3] and [16]. The weight enumerator Ad can be analysed 
asymptotically to test if the ensemble is asymptotically good. 
If this is the case, then we can say that the majority of codes 
in the ensemble have minimum distance growing linearly at 
least as fast as nSmin, where dmin is the minimum distance 
growth rate of the code ensemble [3]. 

III. Quasi-cyclic protograph-based LDPC codes 

One of the main advantages of quasi-cyclic LDPC codes is 
that they can be described simply, and as such are attractive for 
implementation purposes (see, e.g., [10]). In this section, we 
focus on the quasi-cyclic sub-ensembles of protograph-based 
ensembles of LDPC codes and review the existing literature 
that will be used to analyse these ensembles. 

A. Structure of QC sub-ensembles 

Given a protograph base matrix B, a parity-check matrix 
H from the ensemble of protograph based codes Cb(^) is 
created by replacing each non-zero entry r with a summation 
of r non-overlapping permutation matrices of size N x N and 



eeg'^w. 



replacing zeros with the N x N all-zero matrix. The quasi- 
cyclic sub-ensemble, denoted ^g (A^), is the subset of parity- 
check matrices from ^b(^), where each of the permutation 
submatrices are chosen to be circulant. The notation la is used 
to denote the N x N identity matrix with each row cyclically 
shifted to the left by a positions. The set of all such matrices 
comprise the circulant subset of the set of N x N permutation 
matrices. When applying the copy-and-permute operation, by 
restricting the choice of permutation matrices to come from 
this subset, the resulting parity-check matrix H will be quasi- 
cyclic, i.e., H e ^g (A^) C ^b(^)- For example, a quasi- 
cyclic parity-check matrix can be formed from the base matrix 
defined in Figure [T] as 

' h+h h ■ 

H = /s /lo + 1-20 I9 + -^18 I7 

.0 hi /23 + /17. 

When considering a sub-ensemble such as ^3 (A^), one has 
to be careful with the relevance of asymptotic results obtained 
for the ensemble ^b(-^)- As N ^ 00, if the probablility of 
choosing a member of the sub-ensemble is non-zero we say 
that the code is a typical member of the ensemble. By this 
definition, it is clear that the sub-ensemble ^g (A^) contains 
atypical codes. This follows since there are only A^ out of A^! 
permutations that are circulant, i.e., the fraction of choices of 
permutation matrices that are circulant is N/Nl ~ 1/(A^— 1)!, 
which tends to zero as A^ — J> 00. Then, if the base matrix 
B contains only ones and zeros, the fraction of codes in the 
ensemble that are circulant is (1/(A^ — 1)!)''' where k is the 
number of ones in B. Repeated edges in B further reduce this 
fraction. 

B. Minimum Hamming distance bounds for QC sub-ensembles 

If the base matrix B contains only ones and zeros, then 
it is well known that the minimum Hamming distance of 
any code from the quasi-cyclic sub-ensemble of protograph- 
based LDPC codes can immediately be bounded above by 
(ric + 1)! [13], [14]. This result was improved and extended 
by Smarandache and Vontobel to base matrices with entries 
larger than one [15]. Let the permanent of an m x m matrix 
B be defined as 



perm 
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where we sum over the m! permutations <t of the set 
{1, . . . , m}. Then the minimum distance of a code drawn from 
the QC sub-ensemble can be upper bounded as follows; 

Theorem 1: Let C be a code from ^3 (A^), the quasi-cyclic 
sub-ensemble of the protograph-based ensemble of codes 
formed from base matrix B. Then the minimum Hamming 
distance of C is bounded above a^ 

drnm(C) < min* V'perm(Bs\i), (1) 

SC{l,...,rit,}^ 
|S|=n, + l ^^■^ 

where perm(B5yj) denotes the permanent of the matrix con- 
structed as the Tie columns of B from the set S\i. 

'The min*{} operator returns the smallest non-zero value from a set. In 
this context, if the all-zero codeword arises from a constructed matrix, this 
operator ensures that is disregarded as an upper bound in the minimization. 



IV. Terminated protograph-based LDPC 

CONVOLUTIONAL CODES 

A rate R = b/c (time-varying) binary LDPC convolutional 
code [4] can be defined as the set of infinite binary sequences 
V[-oo.oo] that satisfy the equation V[_oo,ooiHy 
where 



— oo.oo 



= 0, 



H 



— 00,00 



alio) 



aUms) 



Ho^(t) 



llj(t + m,) 



is the transposed parity-check matrix, also called the syndrome 
former matrix. The binary {c ~ b) x c submatrices Hi(t), i = 
0, 1,--- jTOs, satisfy the conditions that H„j^(i) j^ for at 
least one t € Z and that Ho(t) has full rank for all t. We 
call 771s the syndrome fanner memory and Us = {mg + 1) • c 
the decoding constraint length. These parameters determine 
the width of the nonzero diagonal region of H[_tx) oqJ. The 
sparsity of the parity-check matrix is insured by demanding 
that its rows have Hamming weight much less than Vg. The 
code is said to be regular if its parity-check matrix H[_oo,oo] 
has exactly J ones in every column and K ones in every row. 

A. Constructing protograph-based LDPC convolutional codes 

Analogously to block codes, an ensemble of LDPC con- 
volutional codes can be constructed from a protograph. We 
proceed by forming a time-invariant infinite base matrix^ with 
component be x b^ submatrices Bq, Bi, . . . , B,„^ as follows: 



Br 



B, 



Bn 



B, 



Bo 



(2) 



The infinite Tanner graph associated with 'Q[-ao,oo\ can be 
regarded as a convolutional protograph. An ensemble of 
time-varying LDPC convolutional codes can be formed from 
B[_oo,oo] using the protograph construction method based on 
N X N permutation matrices described in Section |ll] Given a 
base matrix B, one can form a convolutional protograph with 
the same rate and degree distribution by creating the submatri- 
ces Bo, Bi, . . . , Bm^ using an edge-spreading technique [7]. 
Here, the edges of the protograph base matrix B are spread 
over the component submatrices such that Bo + Bi + . . . + 
B„i^ = B. Note that the submatrices necessarily have the 
same size as B. 

B. Forming terminated protograph-based LDPC convolutional 
codes 

Suppose that we start the base matrix defined in (J2| at time 
t = and terminate it after L time instants. The resulting 
finite-length base matrix is given by 

-If the base matrix contains only ones and zeros, it represents tlie parity- 
clieck matrix of a rate R = 1 — bc/bv time-invariant convolutional code with 
syndrome former memory m^ ■ 



B 



Bn 



B, 



[0,L-1] - Bo 



(3) 



™- (L+ms}b^xLb„ 

The matrix B[o.l-i] can be considered as the base matrix 
of a terminated protograph-based LDPC convolutional code 
ensemble. Termination in this fashion results in a rate loss. 
The design rate Rl of the terminated code ensemble is equal 
to 



Rl = i- 



L 



be 

by 



1 



L + 771 s 

L 



(1-i?), (4) 



where R — 1 — Nbc/Nb^ = 1 — bc/bu is the rate of the 
unterminated LDPC convolutional code ensemble. Note that, 
as the termination factor L increases, the rate increases and 
approaches the rate of the unterminated LDPC convolutional 
code ensemble. In addition, as L — > oo, the degree distribution 
approaches that of the unterminated ensemble. It follows that 
if the base matrix B is (J, if) -regular, and we apply the 
edge spreading technique to preserve the structure, the degree 
distribution of the terminated ensemble approaches that of a 
(J, iiT) -regular ensemble as L — > oo, i.e., it is asymptotically 
regular The protograph-based LDPC block code ensemble 
associated with B[o_i_i] can be studied using the analysis 
discussed in Section HIl 

V. QC ASYMPTOTICALLY REGULAR LDPC CODES 

In this section, we form families of asymptotically regular 
LDPC block code ensembles by terminating (J, if) -regular 
protograph-based LDPC convolutional codes. It was shown in 
[8] that the minimum distance growth rates and the iterative 
decoding thresholds of asymptotically good terminated ensem- 
bles are sensitive to the choice of component protographs 
used in the edge spreading technique. Here, we investigate 
how the choice of component protographs affects the upper 
bound on the minimum Hamming distance of the QC sub- 
ensemble ^3 (N). Even though the QC codes are not 
typical members of the ensemble, we observe that choosing 
component submatrices that yield strong ensemble minimum 
distance growth rates also gives large upper bounds on the 
minimum distance of the QC sub-ensemble. Further, we show 
that by using arrays of circulants, which can alternatively be 
viewed as the QC sub-ensemble arising from a graph-cover 
of the protograph, we can increase the upper bound on the 
Hamming distance of codes chosen from ^g (N). 

To begin, we compare different edge spreadings that result 
in asymptotically regular (3, 6) ensembles. 

Example 1: Consider spreading the edges of the base matrix 
B = [ 3 3 ] into component submatrices 



Br 



1 1 



Bi — B2 



where Bq + Bi + B2 = B. Using these component submatri- 
ces, we can obtain the base matrix for a (3,6)-regular LDPC 
convolutional code ensemble with syndrome former memory 



rus = 2. The terminated ensembles in this family were 
shown to be asymptotically good with thresholds converging 
to the (optimal) MAP decoding threshold e* = 0.4881 for 
(3, 6) -regular LDPC codes on the BEC as L ^ oo [8]. 
For termination factor L = 4, the ensemble has design rate 



(4) 



0.0814, 



i?4 = 1/4, minimum distance growth rate S[ 
and BEC iterative decoding threshold e* = 0.635. Terminating 
after L = 10 time instants, the rate increases to Riq = 2/5, 
the minimum distance growth rate is <^,„j„ = 0.0258, and 
the threshold is e* = 0.505. As L -^^ oo, the minimum 
distance growth rate tends to zero and the threshold converges 
to e* = 0.488 (close to the Shannon limit Ssh = 0.5 for rate 
i?oo = 1/2). 

Using Theorem [T| and the base matrix B[o,2] {L = 3), 
we calculate that the minimum Hamming distance for the 
circulant sub-ensemble ^g (N) is bounded above by 56, 
i.e., d„imQC' < 56 for any circulant size N. To show that 
this upper bound is indeed achievable, consider the following 
parity-check matrix: 

h h 

I 5 ho 

H = 1-25 Il9 




With circulant size A^ = 49, this parity-check matrix defines a 
[294, 51, 56] QC binary linear code with girth 8 (in this case, 
H has 2 redundant rows). Note that, for typical codes from 
the ensemble ^Btq ,, (-^), the (asymptotic) minimum distance 
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growth rate is 5,^ 



(3) _ 



0.1419. 



For termination factors L > 3, the upper bound d 



miiiQC 



< 



56 remains constant. It follows that this is also an upper 
bound on the free distance of the circulant sub-ensemble of 
protograph-based LDPC convolutional codes, i.e., dfreeQC < 
56. In addition, as the termination length of the convolutional 
protograph increases, the asymptotically regular ensembles 
display capacity approaching iterative decoding thresholds. 
Even though these thresholds are not achievable with QC 
codes because small cycles exist in the Tanner graph, we 
expect that QC codes drawn from ensembles with a better 
iterative decoding threshold will display better performance 
in the waterfall region of the bit error rate curve, even for 
finite block lengths (see, e.g., [17]). In practice, the design 
parameter L adds an additional degree of freedom to existing 
block code designs. Starting from any LDPC block code, it 
is possible to derive terminated convolutional codes that share 
the same encoding and decoding architecture for arbitrary L. 
Example 2: Let B be the all-ones matrix of size 3x6. 
Consider the following edge spreading of B: 

"1110 

Bo = 1110 

111 

Using Bo and Bi as given above, the asymptotically regu- 
lar (3, 6) ensemble defined by (O has six degree 3 check 
nodes and 3L — 3 degree 6 check nodes for termination 
factors L > 2. The protographs in this terminated family 
will be highly regular with no degree 2 check nodes. The 
family of terminated (3, 6)-regular LDPC convolutional code 



and Bi = B Bo 



ensembles resulting from this edge spreading were shown 
to have increased minimum distance growth rates and BEC 
thresholds when compared to equal rate ensembles from the 
family defined in Example 1 (see [8]). For example, for L = 2, 
i?2 = 1/4, (S^L = 0.0920, and e* = 0.6471. The improved 
minimum distance growth rates are reflected in the upper 
bound on codes chosen from the QC sub-ensemble. For this 
family, we calculate dminQC < 176 for L > 2. 

Example 3: We now consider a 'bad' example of edge 
spreading. Consider the following component matrices ob- 
tained by edge spreading the all-ones base matrix B of size 
3x6: 

'1110 

1110 

111 



B 



— 



and Bi = B Bo 



This ensemble has relatively poor iterative decoding thresholds 
and minimum distance growth rates compared to the other 
asymptotically regular (3, 6) families. The BEC thresholds 
for this family converge to 0.4734 (compared to 0.4881 for 
the other asymptotically regular (3, 6) examples), and for 
L = 2, when i?2 = 1/4, the minimum distance growth 



rate is just S, 



(2) 



0.0296 with threshold e* 



0.4949. 



When calculating the upper bound on the minimum distance of 
members of the QC sub-ensemble ^^ (A^) for this edge 

spreading, we note that, for any termination factor L, after 
some row permutations the ensemble contains the following 
sub-structure: 
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which limits the circulant minimum distance to d 



'iniriQC 



<36. 



This small upper bound for the QC sub-ensemble reflects the 
poor ensemble minimum distance growth rates. 

Example 4: As a final asymptotically regular (3, 6) example, 
the edge spreading [8] 

B = [ 3 3 ] -> Bo = [ 2 1 ] and Bi = [ 1 2 ] . 

was shown to result in a family with the largest minimum 
distance growth rates of all the asymptotically regular (3,6) 
families considered. In addition, for small values of L, the 
thresholds were shown to be the same as or larger than 
other asymptotically regular (3, 6) ensembles of the same rate, 
and, as with the other 'good' edge spreadings, the BEC BP 
thresholds converge to the optimal MAP decoding thresholds 
for (3, 6)-regular ensembles (e = 0.4881). 

In this case, for i = 2, i?2 = 1/4, (5^L = 0.0950, and e* = 
0.6447. However, we note that for the circulant sub-ensemble, 
we obtain only dminQC < 30 for L > 2, a relatively small 
upper bound, which can be achieved for small circulant size 
N . The parity-check matrix given as an example in Section 
IIII-Al is a member of the QC sub-ensemble ^Bro ii (^)- Using 
circulants of size A^ = 38 in this parity-check matrix, we 
achieve dmin = 30 (this is a [152, 38, 30] binary linear code 
with girth 8). 

We now show that by taking rre-covers of this protograph we 
can increase the bound. For example, consider the following 
2-cover: 



K = 



1110 
110 1 



and B[ 



10 11 
111 



Using these component submatrices, we obtain the base ma- 
trix BJ_^ ^, for a (3,6)-regular LDPC convolutional code 
ensemble with syndrome former memory iris = 1- The termi- 
nated ensemble constructed from the component two-covers 
is denoted as ^b' (N). It follows that fe' (N) C 

^Bro t_ii(2^), because any A^-cover of an m-cover exists in 
the set of mA^-covers of the original protograph. Interestingly, 
we calculate the minimum distance growth rate (5)j^j„ = 0.095 
(and threshold e* = 0.6447) for both the original ensemble 
Cbto i_ii (N) and the two-cover ensemble ^b' (N). From 

this we conclude that typical codes with the same length from 
either ensemble would have the same minimum distance. 

This is clearly not the case for the QC sub-ensembles. It is 
a simple exercise to choose circulants so that a code C" from 
the quasi-cyclic sub-ensemble of the two-cover ^f?, (N) 

does not exist in the original QC sub-ensemble ^3 (2A^), 

and vice versa. Using the 2-cover component submatrices, the 
upper bound on the minimum distance of members of the QC 
sub-ensemble increases to dminQC < 82. The improvement 
can be verified quickly, since it is relatively easy to construct 
a code with minimum distance larger than 30 from this sub- 
ensemble. 

Moreover, by taking 3-covers of the component submatrices: 
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the resulting terminated ensembles also have minimum dis- 
tance growth rate (5!^]„ = 0.095 and threshold e* = 0.6447, 

oc 
yet we calculate dminoc < 210 for codes from ^i;,, (N) 

with L > 2. Comparing the value obtained for this 3- 
cover with Examples 2 and 3, which also have component 
submatrices of size 3x6 and rus = 1, we obtain the largest 
bound for the ensemble with the largest minimum distance 
growth rate. The improvement we observe by taking graph 
covers of the protograph can be attributed to permitting arrays 
of circulants to replace entries in the base matrix B. 

Table H] gives a summary of the results for Examples 1-4 
considered above. 



Example 


<5m,„ (R = 1/4) 


e* (R = 1/4) 


Upp. bnd. on dminQC 


1 


0.0815 


0.6353 


56 


2 


0.0920 


0.6471 


176 


3 


0.0296 


0.4949 


36 


4 (3-cover) 


0.0950 


0.6447 


210 



TABLE I: Comparison of 6min, BEC thresholds, and bounds 
on dminQC for several asymptotically regular (3, 6) families 

VI. Conclusions 

Asymptotically regular LDPC codes based on protographs 
have been shown to display capacity approaching iterative 
decoding thresholds with minimum distance that grows lin- 
early with block length. Both the minimum distance growth 
rate and threshold have been shown to depend closely on the 
choice of component protographs. In the interests of efficient 
implementation, this paper has explored the properties of the 
quasi-cyclic sub-ensembles of protograph-based codes. It was 



shown that, even though the members of the QC sub-ensemble 
are not typical members of the ensemble, the upper bound 
on the minimum Hamming distance of members of this sub- 
ensemble can be improved using choices of edge spreading 
that result in good ensemble minimum distance growth rates. 
In addition, the upper bound obtained for several of the 
examples presented here was shown to be achieveable by 
constructing codes with this minimum distance. Finally, we 
showed that the upper bound obtained for the QC codes in 
the ensemble can be improved by using arrays of circulants 
in a graph cover of the protograph. Due to space limitations, 
we have only presented results for edge spreadings of (3, 6)- 
regular base matrices B; however, similar results are observed 
for arbitrary J and K. 
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